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Abstract. A folk theorem says higher order arithmetic has the proof theoretic 
strength of set theory with limited power set. This paper makes the theorem 
precise in terms of several axiom systems based on ZF as well as systems based 
on the elementary theory of the category of sets. 
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A folk theorem says n-th order arithmetic Z n has the proof theoretic strength of 
set theory with a restricted power set, but no precise statement has been published 
for n > 2 and no proof. This paper specifies axioms for ZF[n] based on ZF and 
for ETR[n] based on the elementary theory of the category of sets (ETCS), each 
positing that the natural numbers have n successive power sets. In other words 
each asserts that 1\ n exists where as usual 

Ho = H and "2 n+1 = 2 3 " 

Obviously ZF[n] interprets ETR[n] and both interpret Z n+2 - We show Z n+2 inter- 
prets stronger theories ZF[n*] and ETR[n*] positing for every set A smaller than 
3„ + i and every set B whatever there is a set of all functions A— !•£?. Here to say 
A is smaller than B, written A < B, means there is a one-to-one function A— 
Each of these theories interprets itself plus a global well ordering relation. These 
extensions are useful in applications as they obviate some appeals to power sets. 

1. The set theories ZF[n], ZF[n*], and ZFG[n*] 

The set theory ZF[0], often called ZF— , is Zermelo-Frankel without power set. 
The axioms are: 

• Extensionality: \/z{z <Ex-^z£y)—>x = y. 

• Empty set: 3yVz—i[z G y]. 

• Foundation: 3a(a G x) — > 3y(y G x A ->3z(z G y A z G x)). 

• Pair set: 3vjiz[z G w (z = x V z = y)]. 

• Sum set: 3u\/z[z G U 3y [z G y A y G x)] 

• Infinity: 3x[0 G x A Vy(y G x — > y U {y} G x)]. 

• Replacement: For any formula <j)(x,y) in the language of ZF: 

Wx(x G w — > 3ly <p(x, y)) — > 3v My (y G V O 3a; (x G w A <fi(x, y)) 
Examples show how replacement does some things commonly done by power set. 
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Theorem 1.1. (In ZF[0]) Any two sets have a cartesian product AxB. 

Proof. Form Kuratowski ordered pairs by repeated pair sets. For any sets a, B, 
replace each b G B by (a, b) to get {a} x B. Then replace each a G A by {a} x B to 
get {{a}xB | a 6 A} with sum set AxB = {(a,b) \ a e A,b e B}. □ 

Theorem 1.2. (In ZF[0]) Every equivalence relation RC.AxA has a quotient set 
of equivalence classes. 

Proof. Replace each a G A by its set of relata R a = {a' G A\ (a, a') G R} to get the 
set of equivalence classes A/R = {% C A\ 3a G A (w = R a ) }. □ 

Theorem 1.3. (In ZF[0]) There is a unique setu satisfying infinity plus induction, 
call it the set of natural numbers. 

06wA My(y G u -> y U {?/} £u) A 

Vcr C u [V G cr A Vy(y € cr -> y U {?/} G cr) -> cr = uj] 

Proof. Given x as in the axiom of infinity take Dedekind's subset: 

W = {z G at | Vcr C x (V G cr A Vy(y G cr — > y U {y} G cr) ->• z G uj) } □ 

Theorem 1.4. (In ZF[0]) Every set A has a set A <ul of all finite strings and a set 
Fin(j4) of all finite subsets of A. 

Proof. Replacing each n G uj by the product set A n gives {A n \ n G ui} with sum 
set A <ul . Replacing each n-tuple by the set of its entries gives Fin(A). □ 

The ZF[1] axioms further posit ui has a power set V(uj) = 3 1? while ZF[2] posits 
a power set W(oj) = H2, and so on for all ZF[n]. Even before seeing the precise 
axioms of Section [3] it is clear ZF[0] interprets 2nd order arithmetic Z 2l and more 
generally ZF[n] interprets (n + 2) order arithmetic Z( n+2 ). 

Theorem 1 1 .41 showed ZF[0] implies every set has a set of all its finite subsets. To 
the contrary, ZF[1] does not imply every set has a set of all its countable subsets. 
Indeed for no n > does ZF[n] imply every set has a set of all countable subsets, 
let alone all subsets which are smaller than Lemma 1 1 . 61 shows this. 

Lemma 1.5. ZF[n] for < n proves these statements are equivalent: 

(1) Every set A has a set of all its subsets smaller than 1\ n . 

(2) For every set A, and every B < H n there is a set of all functions _B— > A 

Proof. 1 implies 2 since the graph of a function B — > A is a subset of B x A the size 
of B. And 2 implies 1 as every subset of A smaller than D„ is the image of some 
function B A with B < 3„ . □ 

For each n G N define ZF[n*] to be ZF[n] plus either of the conditions in 
Lemma [1.51 If ZF is consistent then ZF[n] does not imply ZF[n*] when n > 0. 
Weaker consistency assumptions would suffice but I am not concerned with that: 

Lemma 1.6. ZF proves for every 1 < n G N there is a model of ZF[n] where not 
every set has a set of all its countable subsets. 

Proof. Using ZF and working in L we can assume GCH, so D„ = K„. Consider the 
set Hx^+i of all sets hereditarily smaller than K w + 1. This models every ZF[n]. 
But it also includes H w and all its subsets, while Kdnig's inequality shows the set 
of countable subsets of H w is larger than H w . □ 
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On the other hand there is an inner model: 
Theorem 1.7. ZF[n] has a well ordered inner model of ZF[n*]. 
Proof . The inner model L of constructible sets works in ZF[0] nearly as in (ICohenl . 



1966() . Only notice that given L a (Cohen's M a ) we can get L a+ \ without using 
power set, by replacement on the finite strings of elements of L a and the set of 
formulas of ZF. Axioms other than power set are verified in Cohen's way and his 
argument works in ZF[0] to show that if a set A in L has an actual power set then 
some set in L is a power set for A within L. So ZF[n] proves L is an inner model 
of ZF[n] and the usual relations are absolute. 

Cohen's verification of choice in L defines a well-ordering of L without using 
power sets, so we can assume choice. As to GCH, Cohen's argument works to show 
ZF[n] + (V — L) proves D„ = H„, and that every subset of D„ is constructed by 
some ordinal with cardinality H rl . So ZF[n] + (V = L) proves if H„ + i exists it is 
3„ + i. By coffnality, it proves that when Hi < A < H„ + i then the set of all subsets 
of A with cardinality less than A is in bijection with A. So ZF[n] shows L restricted 
to sets hereditarily smaller than H n+ i satisfies ZF[n*]. □ 

For each n G N define ZFG[n*] to be ZF[n*] plus a global well-ordering of sets. 
That is extend ZF[n*] by a linear order y < 7 z on sets and an axiom scheme saying 
every non-empty class has a < 7 -minimal clement; and extend the replacement 
axiom scheme to include formulas using this relation. 

Corollary 1.8. ZF[n] has an inner model of ZFG[n* J. 



2. The categorical set theories 

The categorical set theory ETR[0] is ETCS without the axioms of function set 
or choice, but adding a replacement scheme. Dropping function sets means these 
axioms do not describe a topos. The axioms are: 

• Like a topos: Sets and functions form a category with all finite products, 
coequalizers, and a subset classifier Q. 

• Nontriviality: Besides true: 1— s-fi there is a distinct element false: l^il. 

• Infinity: There is a natural number object N, 0, s. 

• Extensionality: For any parallel pair of functions f,g:A—^B if f(x) = g(x) 
for all x € A, then / — g. 

• Replacement: For each relation R{x, h\ , . . . , h n ) of functions in the lan- 
guage of ETCS this is an axiom: 

Va; G A 3!j(/ii, • • • , h n ) (R(x, hi, . . . , h n )) 

3(hi, . . . , h n ) \fx G A (R(x, x*h\, . . . , x*h n ) 



With the axiom of choice, the replacement axiom of iMcLartvl (|2004l ) implies 



this one. But not without choice. The quantifier 3!^ reads "exists uniquely up 
to isomorphism." The idea is that a finite list of functions (meeting some given 
conditions) gives a finite diagram (of some shape). The notation x*h for x G A 
says h is a function over set A and x*h its pullback over x: 
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The axiom says that if each element of a set A is i?-related to a finite diagram (of 
a given shape) unique up to isomorphism, then there is a diagram (of the same 
shape) of sets over A such that its pullback over any x € A is i?-related to x. 
So separation holds for arbitrary formulas, not only bounded formulas: 

Theorem 2.1. (In ETR[0]) For each property P{x) of arrows this is a theorem: 

VA 3S 3 a monic i:S^A [Vx G A (x e S o P(x))] 

Proof. Use replacement on the relation R(x, h) given by 

( P{x) & h = x as monies to A) V (-iP(x) & ft. = 0— »• A as monies to A) □ 

Separation immediately implies every parallel pair of functions f,g:A—+B has an 
equalizer defined by f{x) — g(x). Existence of coequalizers shows every equivalence 
relation R>^Ax A has a quotient A^-A/R. The familiar argument by extension- 
ality in a well-pointed topos works in this context as well to show true, false are 
the only elements of Q and their intersection has no elements. Separation then 
shows the empty set is initial in the category of sets and functions, 51 is the 
coproduct 1 + 1. Replacement on the two elements of 1 + 1 proves all sets A, B 
have a coproduct or disjoint union A + B. 

Separation shows every subset of any set has a Boolean complement, and every 
two subsets of any set A have an intersection and union. Further, for any relation 
r : R>— > AxB there are subsets of A: 

{x S A\3y e B{R{x, y)} and {x E A|Vy £ B(R(x, y)} 

Theorem 2.2. (In ETR[0]) Every set A has a set A <u of all finite strings, indeed 
an N-indexed set A <u — >N assigning each string its length. 

Proof. Essentially like Theorem II .41 □ 

The ETR[1] axioms posit N has a power set V(N) = 3i, while ETR[2] posits a 
power set VP(N) = D2, and so on. The stronger ETR[n*] posits for any A with a 
monic A>— >3„+i and any set B there is an exponential B A , while ETWO[n*] adds 
a global well ordering relation. ZFWO[n*] obviously interprets all of these. 

Clearly ETR[n] interprets Z n+ 2- So the following section concludes the proof 
that all of these theories interpret one another. 



Our n-th order arithmetic Z n uses a language without product types, and with 
terms of order <n. So first order arithmetic PA= Z\ has natural number terms of 
order 0. Second order Z2 adds terms of order 1 for classes of natural numbers, while 
Z n has class terms of all orders below n. We write Vj or 3j to quantify over order 



3. Interpreting ZF[n] in Z n+2 
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i variables. We adopt the Peano axioms in order 0, except for induction which we 
state with an order 1 class variable: 

Vict [(0 G a A V y(y e a ^ S(y) G a) -> V y (y G a) ] 

We adopt the impredicative comprehension scheme for each order i < n — 1: 

El. i+:L X Vxi ) 

for any formula <p(x) with X not fre e . Thi s is the system of ISimpson 

(HOIS p. 4) 

extended to higher order, or iTakeutil (|1987l pp. 192) omitting product types. 

3.1. j-indexing. We adopt any encoding of finite sequences of natural numbers 
by natural numbers. To encode each (finite or infinite) sequence of classes of order 
i > as a single class of order i we use the idea of j-indexing. Here i is not a term 
of Z n +2 but of the language in which we discuss Z n +2- We cannot quantify on it 
in In contrast j is an order variable which we can quantify in Z n +2- 

Precisely, in Z„ +2 , use any standard coding of pairs of natural numbers to define 
indexing operators *j by recursion (recursion in our language, not in Z n+ 2): 

j *o k is defined as (j, k) and j *i x is defined as {j y \ y G x} 

In effect j *j x is the class built up from j-indexed numbers just the way x is built 
up from (un-indexed) natural numbers. Clearly Z n+2 proves 

j*iX = j-kiV -> x = y 

In Z n+ 2 a sequence of order i classes for i < n + 2, is any order i class where 
every element is j-indexed for some j. Formally: 

Seqi(x) is defined as Vi_iy (y e x ->• 3 j 3i-iy' {y = j *< y'))) 

The j-component Compi(j, x) of any order-i class x is the subclass of x made of 
j-indexed numbers. Formally: 

Compi(j, x) is defined as {y G x 3n-iy' (y = j *i y'))) 

So a sequence of order i classes is determined by its j-components for all j. 

Lemma 3.1. In any Z n+ 2 and for any i < n + 2, every sequence of order i classes 
is the union of its components. 

3.2. Singleton inclusion. In Z n+ 2 every order i from to n has a natural image 
in the next higher. We point out this image is defined by a formula Singi(x,y) 
with variables x, y of orders i and i + 1 respectively: 

Singi(x, y) is defined as \/iw (w G y ■<-> w = x) 

Again i is not a term of Z n+ 2 and we cannot quantifier on it in Z n+ 2- 

Lemma 3.2. The singleton embedding commutes with j-indexing: 

V j ( Singi(x, y) -> Sing^j x, j y) ) 

So singleton image preserves sequences and their j-components for each j. This 
will mean that the singleton image of an interpretation of ZF[ Ti] in Z n j^2 is an 
interpretation of ZF[n] in ^„+3 which we will extend to interpret ZF[n + 1]. 

3.3. The interpretations. We follow lSimpson (2010. §VII.3, pp. 258ff.) interpret- 
ing ZF[0] in Z2. Some exposition will motivate the higher order extension. 
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3.3.1. The Z2 case. Working in Z2, interpret a ZF set as a tree, where each node 
represents a set with one element for each daughter node. In turn a tree is coded 
as a class of finite sequences of numbers which includes every initial segment of 
any sequence in it. Each sequence is considered a daughter of its initial segment 
missing just the last entry. For an example using strings of one-digit numbers take 
this class of sequences where [] is the empty sequence: 

T={ 0, 0, 1, 10, 2, 20, 21, 210} 

This encodes a downward growing tree: 




Three nodes below [] show this represents a three element set. No nodes below 
the leftmost node shows this node represents the empty set. Altogether this 
tree encodes the von Neumann ordinal 2, that is {0, {0}, {0, {0}}}. Define the 
standard tree for a finite von Neumann ordinal n as the class of all strictly decreasing 
sequences of natural numbers that begin with some j < n. 

For the class of all strictly decreasing sequences of natural numbers, each branch 
remains finite but their lengths are unbounded and the tree is infinitely wide. This 
represents the von Neumann natural number set N. So the tree interpetation in Z2 
satisfies the axiom of infinity. The empty set axiom is verified by the tree with no 
nodes beyond the empty sequence []. 

We must say which trees represent sets, by means expressible in Z^: 

Definition 3.3. A class a of sequences of natural numbers is a suitable tree if: 

(1) a contains every initial segment of any sequence in a, 

(2) the empty sequence [] is in a, 

(3) it is well-founded: no infinite sequence has all its initial segments in a. 
Consider the suitable classes {[],0, 1} and {[],2} encoding these trees: 



At a hasty glance the left tree represents a two element set but both its nodes 
represent the same set, namely the empty set. We want to construe it as represent- 
ing the same set {0} as the tree on the right. We need an equivalence relation on 
suitable classes to represent equality of sets, and we must use this relation to define 
a relation on suita ble classes representing set elementhood. 

bimpson (|2010L p. 263) handles this in Z2 and shows the fragment ATRq proves 



these interpret much of ZF[0]. Of course ATRo is designedly weaker than replace- 
ment but the verification in Z2 is straightforward: For any relation expressible in Z2 
associating a tree to each daughter of [] in a given tree, Z2 comprehension suffices 
to produce a tree replacing each daughter of [] by the corresponding tree. 
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3.3.2. Lifting to Z n +2- The above construction works as well in Z3 to interpret 
ZF[0] in order 1 (indeed to give a model, a class of order 2, but we will not use 
that fact). This interpretation will assuredly not extend to ZF[1] as there are no 
uncountable classes in order 1. So we adapt definition 13.31 to Z3: 

Definition 3.4. A class a of sequences of classes of natural numbers is a suitable 
Z3 tree if: 

(1) a contains every initial segment of any sequence in a, 

(2) the empty sequence [] is in a, 

(3) it is well-founded: no infinite sequence has all its initial segments in a. 

The singleton image of our first interpretation consists of suitable Z3 trees where 
all the sequences of classes of natural numbers are sequences of singleton classes. So 
extend it to all suitable Z3 trees, with the corresponding interpretation of equality 
and membership of sets. 

This satisfies ZF[0] by the same reasoning as before. To show it satisfies ZF[1] 
we need some suitable Z3 tree representing the power set of the natural numbers N. 
The simplest is the class of all initial sequences of any sequence a'{n}'a where a is 
a subset of N, n is in a, and a is the singleton image of a sequence in the standard 
Z2 tree for the von Neumann ordinal n. 

The extension to ZF[n] in Z n+ 2 for all n is clear. The chief thing is to replace 
each natural number in the Z2 interpretation of ZF[0] by the n-fold singleton of the 
singleton of . . . of that number. 

References 

Cohen, P. (1966). Set Theory and the Continuum Hypothesis. W.A. Benjamin. 
McLarty, C. (2004). Exploring categorical structuralism. Philosophia Mathematical 
pages 37-53. 

Simpson, S. (2010). Subsystems of Second Order Arithmetic. Cambridge University 
Press. 

Takeuti, G. (1987). Proof Theory. Elsevier Science Ltd, 2nd edition. 



